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DETAILED ACTION 



Claim Rejections - 35 USC § 103 



1 . The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 

obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

2. Claims 1-20 are rejected under 35 U.S.C. 103(a) as being unpatentable over Vu 
et al (U.S. Patent No. 6,393,427, hereinafter, "Vu") in view of Myers et al (U.S. Patent 
No. 6,374,274, hereinafter, "Myers"). 

With respect to claims 1, 13-14 and 18, Vu discloses the method for 
automatically generating a query, 

defining an organized classification of document content with each class in the 
organized classification of document content having associated therewith a 
classification label, see (col. 3, lines 41-67, col. 1, lines 22-67, col. 6, lines 21-31); 

each classification label corresponding to a category of information in an 
information retrieval system, see (col. 3, lines 41-67, col. 2, lines 2-54, col. 6, lines 21- 
31, Note: document classification results are saved in database based on category); 
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assigning the selected document content a classification label from the organized 
classification of content, see (col. 8, lines 16-65, col. 6, lines 21-31); 

automatically formulating a query that restricts a search at the information 
retrieval system for information concerning the set of entities (web and database are 
used as entities for query) to the category of information in the information retrieval 
system identified by the assigned classification label, see (col. 4, lines 55-67 to col. 5, 
lines 1-37). 

Vu does not explicitly discloses the identifying a set of entities in selected 
document content for searching additional information related to using the information 
retrieval system. However, Myers teaches that documents are identified by each entity, 
which teaches the entities are identified for document classification, see (col. 2, lines 
28-43 and abstract). Therefore, it would have been obvious a person having ordinary 
skill in the art the time invention was made to include the identifying a set of entities in 
selected document content for searching additional information related to using the 
information retrieval system of Vu to have additional information that is document 
content entity for document classification. Because document content entity is one of 
the basic information for document classification or document clustering. 

With respect to claims 2, Vu discloses that limiting the query by adding terms 
(keywords) relating to context information surrounding the set of entities in the selected 
document content, see (col. 5, lines 1-36). 
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With respect to claim 3, Vu discloses that the number of terms added is limited to 
a predefined number, see (col. 7, lines 9-67 to col. 8, lines 1-65). 

With respect to claims 4, Vu discloses that limiting the query by adding terms 
defining the assigned classification label, see (FIG. 3, col. 4, lines 19-67 to col. 5, lines 
1-37). 

With respect to claim 5, Vu discloses that the organized classification of 
document content is defined using a hierarchical organization, see (FIG. 4, col. 6, lines 
1-61). 

With respect to claim 6, Vu discloses that using a text categorizer (classification 
based on keywords) to assign the classification label assigned from the organized 
classification of content, see (FIG. 2-4, col. 4, lines 19-67 to col. 5, lines 1-37). 

With respect to claim 7, Vu discloses that extracting with the text categorizer a 
set of terms relating to the document content; and appending to the query ones of the 
set of terms extracted by the text categorizer to contextualize the query, see (FIG. 2-4, 
col. 4, lines 19-67 to col. 5, lines 1-37). 
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With respect to claim 8, Vu discloses that abbreviating the set of terms extracted 
by the text categorizer to a predefined limit of terms, see (FIG. 2-4, col. 4, lines 19-67 to 
col. 5, lines 1-37). 

With respect to claim 9, Vu discloses that extracting noun phrases from the 
selected document content, see (col. 4, lines 55-67 to col. 5, lines 1-37); 

ranking the noun phrases by those that occur most frequently in the document 
content, see (col. 4, lines 55-67 to col. 5, lines 1-37); 

defining a subset of noun phrases by identifying those ranked noun phrases that 
occur more frequently than a first predefined frequency, see (col. 4, lines 55-67 to col. 5, 
lines 1-37); ranking those words in the subset of noun phrases by their frequency of 
occurrence to define an ordered list of words; defining a subset of the ordered list of 
words by identifying those ranked words that occur more frequently than a second 
predefined frequency; re-ranking the subset of words in inverse frequency to their use in 
the category of information in the information retrieval system identified by the assigned 
classification label; using only those highest ranked words in the re-ranked subset of 
words-to define the set of terms appended to the query, see (col. 4, lines 19-67 to col. 5, 
lines 1-37, Note: extracting most frequently used keywords are used to rank document 
classification). 
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With respect to claims 10-11, Vu discloses that each class in the organized 
classification of document content has associated therewith a characteristic vocabulary, 
see (col. 4, lines 19-67 to col. 5, lines 1-37). 

With respect to claim 12, Vu discloses that enriching selected content of a 
document with personalities that identify enrichment themes, see (FIG. 9B, col. 4, lines 
19-45). 

With respect to claims 15-16, Vu discloses that short length aspect vector 
generator for generating terms relating to context information surrounding the set of 
entities in the selected document content; wherein the query generator adds the terms 
relating to the context information to limit the query, see (col. 4, lines 19-67 to col. 5, 
lines 1-37). 

With respect to claim 1 7, Vu discloses that content manager for enriching the 
selected document content with results provided from the information retrieval system 
using the query, see (col. 4, lines 19-67 to col. 5, lines 1-37, Note: document modified 
by doing filtering and morphing, which is enriching documents). 

With respect to claims 19, Vu discloses that limiting the query by adding terms 
(keywords) relating to context information surrounding the set of entities in the selected 
document content, see (col. 5, lines 1-36). 
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With respect to claims 20, Vu discloses that limiting the query by adding terms 
defining the assigned classification label, see (FIG. 3, col. 4, lines 19-67 to col. 5, lines 
1-37). 

Conclusion 

The prior art made of record and not relied upon is considered pertinent to 
applicant's disclosure. 

Smiley (U.S. Patent No. 5,978,81 1 ) discloses the system for data model for an 
information repository models data as objects, the relationships or interdependences 
between the data, their physical storage or access information and rules or methods of 
accessing the data. 

Johnson et al (U.S. Patent No. 6,212,532) discloses the system for information 
extraction system capable of extracting information from natural language documents. 
The system includes a plurality of interchangeable modules including a data preparation 
module for preparing a first set of raw data having class labels to be tested, the data 
preparation module being selected from a first type of the interchangeable modules. 
The system further includes a feature extraction module for extracting features from the 
raw data received from the data preparation module and storing the features in a vector 
format, the feature extraction module being selected from a second type of the 
interchangeable modules. A core classification module is also provided for applying a 



* 
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learning algorithm to the stored vector format and producing therefrom a resulting 
classifier, the core classification module being selected from a third type of the 
interchangeable modules. A testing module compares the resulting classifier to a set of 
preassigned classes, where the testing module is selected from a fourth type of the 
interchangeable modules, where the testing module tests a second set of raw data 
having class labels received by the data preparation module to determine the degree to 
which the class labels of the second set of raw data approximately corresponds to the 
resulting classifier. 

Byrd, Jr. et al (U.S. Patent No. 5,832,480) discloses the system for Descriptive 
canonical forms of entity types are created by scanning one or more documents in a 
database of a computer system to identify one or more proper names that appear in the 
documents as raw names. Each of the raw names has zero or more proper names, zero 
or more medial substrings, zero or more leading substrings, and zero or more trailing 
substrings. The raw names of one or more documents are "cleaned" and "split" until 
certain "cleaning and splitting conditions" are no longer met to obtain a list of clean and 
split candidate names. Anchor names are selected from the list that unambiguously 
represent an entity type. The anchor names have one or more entity-type attribute 
values. Variant names, clean and split candidate names having one or more shared 
attribute (values) with the anchor name, are combined with the anchor name to create 
an equivalence group of names that refer to the same entity. A canonical form is 
generated for the group from a subset of the anchor name attributes. A canonical form 
is created in this manner for all of the clean and split candidate names on the list. 
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Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Isaac M Woo whose telephone number is (703) 305- 
0081 . The examiner can normally be reached on 8:00-5:00. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Kim Y Vu can be reached on (703) 305-4393. The fax phone numbers for 
the organization where this application or proceeding is assigned are (703) 746-7239 for 
regular communications and (703) 308-6606 for After Final communications. 

Any inquiry of a general nature or relating to the status of this application or 
proceeding should be directed to the receptionist whose telephone number is (703) 305- 
3900. 
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December 13, 2002 




